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Abstract 

The jackknife variance estimator and the the infinitesimal jackknife variance estimator 
are shown to be asymptotically equivalent if the functional of interest is a smooth function 
of the mean or a trimmed L-statistic with Holder continuous weight function. 
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1 Introduction 



This note concerns the asymptotic behavior of the jackknife variance estimator Vj ac k, especially 
regarding its relationship to the infinitesimal jackknife variance estimator Vij ac k- We consider, 
in particular, the variance estimates v j ac k and v ij ac k for smoothly trimmed L-statistics and for 
smooth functions of the sample mean. We prove that v j ac k and Vij ac k are often asymptotically 
equivalent to one another in the sense that 

(1.1) v jack -v ijack = O p (n- h ) 

for some h > 0. The equivalence of v j ac k and Vij ac k can sometimes be used to prove that v j ac k is 
asymptotically normal, but — remarkably — it also holds even when these estimators are not 
asymptotically normal. 

These issues are relevant to the following scenario of statistical practice: One wishes to 
estimate some functional T(p) of an unknown population distribution p, and, to this end, one 
draws n samples from the population and uses T(e n ) to estimate T(p), where e n is the empirical 
distribution of the n samples. One would then like an estimate of the sampling variance of T(e n ) . 
Two widely- used nonparametric estimates of this variance are v jack and Vboot, the jackknife and 
bootstrap variance estimates. Having obtained one of these estimates, one naturally desires to 
know how accurate it is. To assess the accuracy of the usual Monte Carlo approximation of Vb oot , 
one may use the jackknife-after-bootstrap technique of Efron (1992). This note concerns the 
asymptotic behavior of Vj ac k and a closely related estimator Vij ac k, the infinitesimal jackknife. 

Beran(1984) showed that Vj ac k, Vij ac k, and Vboot are asymptotically equivalent, and asymp- 
totically normal, if the functional T(p) has a well-behaved second-order functional derivative. 
While the proof of Beran (1984) requires a strong statement of the Dvoretsky-Kiefer-Wolfowitz 
inequality to handle the asymptotics of Vboot, the asymptotic equivalence of Vj ac k and Vij ac k is 
easier to prove, and does not require second-order differentiability of T. Indeed, we shall see 
that v j ac k and Vij ac k are asymptotically equivalent if T has a well-behaved first-order derivative 
and Vij ac k is consistent as an estimator of the variance of T(e n ). 

The equivalence of Vj ac k to Vij ac k can help one to determine the asymptotic variance of the 
former. For example, Gardiner and Sen (1979) have carefully studied the asymptotic normality 
of v^ack i n the context of L-statistics. We shall see in Section 0] that their work establishes the 
asymptotic normality of Vj ac ki too, thanks to the equivalence of Vj ac k and Vij ac k for variance 
estimation of L-statistics. 

Sometimes Vj ac k and v^ ac k are asymptotically equivalent even when they are not asymptot- 
ically normal. Consider, for example, the estimation of the variance of a function g of the 
sample mean. If g is once, but not twice, continuously differentiable, then Vj ac k and Vij ac k may 
not be asymptotically normal, yet they will satisfy (jl.lj) as long as g' is Holder continuous of 
order h. 

After the necessary definitions are presented in the next section, we prove in Section El that 
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Vj ac k and Vij ac k are asymptotically equivalent as estimators of the variance of smooth functions 
of the sample mean. In Section 0] we discuss the asymptotic normality of Vj ac k as an estimator 
of the variance of trimmed L-statistics. 

2 Background and definitions 

Let p be a probability measure on a sample space X. Given n samples from X, sampled 
independently under the probability law p, one desires to estimate the value T(p) of some real 
functional T on the space V(X) of all probability measures on X. Denote by e n the map that 
converts n data points x\,x%, . . . ,x n into the empirical measure 

1 n 

(2.1) e n (x 1 ,x 2 , . . • ,x n ) = -} 6(xi) 

i=l 

where 5(xi) denotes a point-mass at Xi. The plug-in estimate of T(p) given the data x = 
(x u . ..,x n ) is 

(2.2) T n = T(e n (x)). 

Suppose T n is an asymptotically normal estimator of T(p), i.e., suppose the distribution of 
r\}' 2 (T n — T{p)) tends to A/"(0, cr 2 ). The jackknife is a computational technique for estimating 
a 2 : one transforms the n original data points into n pseudovalues and computes the sample 
variance of those pseudovalues. 

Given the data x = x±, X2, ■ ■ ■ , x n , the jackknife pseudovalues are 

Q ni = nT n (t n ) - (n - l)T(e ni ) % = 1, 2, . . . , n 

with e n as in ()2.1j) and 

(2.3) e 

The jackknife variance estimator is 

(2.4) v jack (x 1 ,x 2 ,. . . ,x n ) = — ^j-^ [Qm ~ Qn ) 2 

n i=l 

where Q n = - J] Qnj- The variance estimator Vj ac k is said to be consistent if f jacfc — >■ a 2 
almost surely as n — ► oo. Sufficient conditions for the consistency of Wjacfc are given in terms of 
the functional differentiability of T. An early result of this kind states that Vj ac k is consistent 
if T is strongly Frechet differentiable (Parr(1985)), and it is now known that Vj ac k is consistent 
even if T is only continuously Gateaux differentiable as defined in Shao(1993). 
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A functional derivative of T at p, denoted dT p , is a linear functional that best approximates 
the behavior of T near p in some sense. For instance, a functional T on the space of bounded 
signed measures Ai(X) is Gateaux differentiable at p if there exists a continuous linear functional 
dT p on M (X) such that 

lim (r 1 (T(p + tm) - T{p)) - dT p (m)\ = 



for all m G M.(X). The concept of Hadamard differentiability is more relevant to statisti- 
cal asymptotics, for the fluctuations of T(e n ) about T{p) are asymptotically normal if T is 
Hadamard differentiable at p. A functional T : V(M.) — > R is Hadamard differentiable at p if 
there exists a continuous linear functional <9T P on .M(IR) such that 

lim ir 1 (T(p + tm t ) -T(p)) - dT p (m)\ = 

whenever {m t }teR is such that limmt = m and m t (R) = for all t, the topology on .M(IR) 

being the one induced by the norm ||m|| = sup {|m((— oo, t])\}. If T is Hadamard differentiable 

teK 

at p, the variance of n 1/2 T(e n ) tends to 

(2.5) a 2 = E p( j) 2 p 
as n — > oo, where 4> p (x) is the influence function 

(2.6) p (x) = «9T p (5(x)-p) 

(this can be shown via the Delta Method using Donsker's theorem (van der Waart(1998))). The 
infinitesimal jackknife estimator (Jaeckel(1972)) of a 2 is obtained by substituting the empirical 
measure e n for p in (J2.5j) : 

(2-7) v ijack = E £n 0^ n . 

3 Functions of the mean 

When q is a measure, we denote J xq(dx) by q if the integral is defined. Let g G C 1 (M) and let 

T(m) = g (fn) 

be defined for all finite signed measures m with finite first moment. The functional derivative 
at m of T, evaluated at q, is dT m (q) = g' (jn)q; the influence function ()2.6|) is m (x) = 
g' (fn) (x — fn). Suppose that Xi, x 2 , ■ ■ ■ are iid p, and p has a finite second moment. Let T n 
denote the plug- in estimator defined in (|2.2j) . Then the asymptotic variance of n 1//2 [T n — T(p)) 
is 

(3.1) a 2 = g'{pf{ x 2 p{dx)-p 2 



Let Vj ac k and Vij ac k denote the jackknife and infinitesimal jackknife variance estimates of a 2 . 
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Proposition 3.1 If g' is Holder continuous of order h > 1/2 (with global Holder constant) and 
p has a finite moment of order 2 + 2h then 

Vjack ^ijack Op(^ )• 

Proof: Setting A ni = (Q nj - Q n ) - </> e „(xi), formula (JOJ) for v jack yields 

1 2 n 1 n 

(3.2) V jack = E €n ^ en + —f-ejl + —^M&ni + E A ™ • 

i=l i=l 

The second term on the right hand side of ()3.2|) is P (1/n) since 

2=1 j=l 

converges almost surely to a 2 . 

To control the last two terms on the right hand side of ()3.2|) we need a bound on A ni . Recall 
the notation e ni of (|2.3|) . Since g is different iable, g (e^j) — g (e^) = g' (rjji) (e^J — c^) for some 
r/ji between and e^J, so that 

^ n 1 n 

Qui - Qn = 22 ^ ~ 61 ) = E 5,7 fo'*) (^7 _ • 

ra j=i ra 3=1 

Therefore, since £n (x;) = (e^) (x< - e^) = ~ J2j 9' (a* - 

n — 1 n 1 " 

A ni = (Q n i - Qn) - <j>e n ( x i) = ^ ff' fa*) (^J - - ~ / V(gw) (^i 

n 3=1 n i=i 

1 n 

= -^2(9' (Vji) - 9 ) 0* - £j ) • 

3=1 

But g' is Holder continuous of order h and — < max{|e^J — e^|, |e^— e^|}, so the identity 
(n - 1) (e„ - e ni ) = 5^ - e n implies that 

|^ (^0-^(^)1 < C , (|e--^| /l +|e--e^| ?t ) < C(n - l)" fc (|^ - a;/ + - a;,f), 

where C is a global Holder constant for g' . It follows that 

1 - 

|A ni | = C(n-l)- h -^2(\e n --x j \ h + \e n --x i \ h )(\e n --x j \ + \e n --x i \). 
H i=i 
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With this bound on A ni , and assuming that p has a finite moment of order 2(1 + h), it may be 
shown that 

1 n 

-Y,^ = O p {n- 2 % 
i=i 

and then, by the Cauchy-Schwartz inequality, that 

I 1 " 
I n 

Substituting the preceding estimates in (|3.2|) completes the proof. □ 

Consider g(x) = x — sgn(x)x 2 . This function has a Lipschitz continuous derivative but does 
not have a second-order derivative at 0. By Proposition 13.11 

Vjack ^ijack Op (1/^7 • 

However, for some population distributions having mean 0, one can prove that Vij ac k is not 
asymptotically normal, and simulations suggest that Vj ac k — Vb 00 t is O p (l/y/n) rather than 
O p (1/n). This example shows that Vj ac k is more closely related to Vij ac k than it is to Vboot- 



4 Trimmed L-statistics 

Suppose that t : (0, 1) — > R is supported on [a, 1 — a] for some < a < 1/2, and let 

(4.1) L{p) = f P- 1 (s)£(s)ds. 

Jo 

Here P~ l denotes the quantile function for p, i.e., P _1 (s) = min{x : P(x) > s} for < s < 1 
where P denotes the cdf of p. A plug-in estimate for L is called a trimmed L-statistic, or a 
trimmed linear combination of quantiles. 

If the weight function I is continuous then L is Hadamard differentiate at all p e P(R) (see, 
e.g., Lemma 22.10 of van der Waart (1998)), and so the L-statistics are asymptotically normal 
(an original reference is Stigler (1974)). The asymptotic variance a 2 of the L-statistics may be 
estimated by Vj ac k, which converges almost surely to a 2 if £ is continuous (Parr(1985), Shao and 
Tu (1995)). The jackknife and infinitesimal jackknife would seem to be the only nonparametric 
methods of consistent variance estimation for L-statistics, aside from the bootstrap (Shucany 
and Parr(1982)). 

We turn now to the question of the asymptotic normality of Vj a ck- In this regard, a variant 
of the L-functional (|4.1|) has been treated in the literature, namely 

(4.2) C{p) = [ xl{P{x))p{dx) . 
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If P is continuous and strictly increasing then £ of (J4.2)) equals L of (|4.1j) . Beran (1984) proves 
that Vj ac k for £ is asymptotically normal — and so is Vb 00 t — if ^ is continuously different iable 
and p has bounded support. Section 2.2.3 of Shao and Tu (1995) incorrectly claims that Vj ac k 
for C, is asymptotically normal if i is Holder continuous of order greater than 1/2, and it also 
wrongly claims that the asymptotic variance equals Var(0p), where <j) p is the influence function 
of C. A detailed discussion of those errors is given in an unpublished technical report (Gottlieb 
(2001)). Nevertheless, a reworking of Definition 2.6 and Theorem 2.7 in Shao and Tu (1995) 
leads us to the following general proposition, which will presently be applied to the case where 
the T(e n ) are L-statistics: 

Proposition 4.1 Let e n denote the empirical distribution of n iid samples fromp, and let Vj ac k 
and Vi j ac k denote the jackknife and infinitesimal jackknife estimates of the variance ofT(e n ). 

Let \\q' — q\\ denote the supremum of the absolute value of the difference between the cdf's of 
q' and q. Suppose that there exist positive constants C , 5, and h such that 

(4.3) T(q') = T(q) + d q T(q'-q)+K(q',q) 

for all q', q with \\q' — p\\, \\q — p\\ < 5, where the remainder \7Z(q', q)\ < C\\q' — q\\ 1+h . Then 

Vjack hijack Op(^ ) 

if Hijack is bounded in probability. 

The straightforward proof of this proposition proceeds like the proof of Proposition 13. II above, 
except that (|4.3j) is used to bound A ni in (|3.2|) . 

Now, let £ be a trimmed L-functional of the form ()4.2|) whose weight function i is Holder 
continuous of order h. Upon integrating the right hand side of (|4.2|) by parts, it becomes easy 
to verify that C admits the expansion (|4.3|) near any p. Since Vij ac k converges almost surely, 
Proposition 14.11 implies that Vj ac k and Vij ac k are asymptotically equivalent. 

This equivalence allows us to conclude that Vj ac k is asymptotically normal for many L- 
functionals, for Gardiner and Sen (1979) have found hypotheses that guarantee the asymptotic 
normality of v^ack for generalized L-functionals of the form ()4.1|) . They begin by assuming that 
the cdf P of the population distribution is continuous. In this case 




ijack = KJt = / / £(P n (y))[P n (yAz)-P n (y)P n (z)]£(P n (z))dydz. 



In order to make contact with the work of Gardiner and Sen (1979), let us suppose that P 
is continuous and strictly increasing. Their hypotheses are general enough to apply to non- 
trimmed L-statistics, but too complicated to be repeated here. Suffice it to say that their 
theorem applies under our current assumptions that i is trimmed and that P has no jumps 
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or flats, if it is assumed in addition that P does not have very heavy tails and that £ is 
piecewise continuously differentiable with Holder continuity of order greater than 1/2 at the 
cusps. (Imagine, for example, a piecewise-linear I whose graph is shaped like a desert mesa; this 
is one of the weight functions recommended in Stigler (1973) for smoothly trimmed means.) In 
these CclSGS Vjack IS asymptotically normal as well, by Proposition 14.11 
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